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Abstract 

We resolve the question of whether Fourier sampling can efficiently solve the hidden subgroup problem. 
Specifically, we show that the hidden subgroup problem over the symmetric group cannot be efficiently 
solved by strong Fourier sampling, even if one may perform an arbitrary POVM on the coset state. These 
results apply to the special case relevant to the Graph Isomorphism problem. 



1 Introduction: the hidden subgroup problem 

Many problems of interest in quantum computing can be reduced to an instance of the Hidden Subgroup 
Problem (HSP). We are given a group G and a function / with the promise that, for some subgroup H C G, 
f is invariant precisely under translation by H: that is, / is constant on the cosets of H and takes distinct 
values on distinct cosets. We then wish to determine the subgroup H by querying /. 

For example, in Simon's problem . G = 1^ an£ l / is an oracle such that, for some y, f(x) = fix + y) 
for all x; in this case H = {0, y} and we wish to identify y. In Shor's factoring algorithm G is the group 
Z* where n is the number we wish to factor, /(x) = r x mod n for a random r < n, and H is the subgroup 
of Z* whose index is the multiplicative order of r. Both Simon's and Shor's algorithms use the following 
approach, referred to as the standard method or Fourier sampling 

Step 1. Prepare two registers, the first in a uniform superposition over the elements of G and the second 
with the value zero, yielding the state 
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Step 2. Query (or calculate) the function / defined on G and XOR it with the second register, 
entangles the two registers and results in the state 
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Step 3. Measure the second register. This puts the first register in a uniform superposition over one of /'s 
level sets, i.e., one of the cosets of H, and disentangles it from the second register. If we observe the 
value /(c), we have the state V3 ® l/( c )) where 



where the sum includes one representative c for each of TJ's cosets. 
Step 4. Carry out the quantum Fourier transform on -03 and measure the result. 

(Note that in Shor's algorithm, since |Z*| is unknown, the Fourier transform is performed over 7L q for some 
q = poly(n); see [32] or OUT].) 

In both Simon's and Shor's algorithms, the group G is abelian; it is not hard to see that, in this abclian 
case, a polynomial number (i.e., polynomial in log |G|) of experiments of this type determine H. In essence, 
each experiment yields a random element of the dual space H 1 - perpendicular to iJ's characteristic function, 
and as soon as these elements span H we are done. 

While the nonabelian hidden subgroup problem appears to be much more difficult, it has very attractive 
applications. In particular, solving the HSP for the symmetric group S n would provide an efficient quantum 
algorithm for the Graph Automorphism and Graph Isomorphism problems (see e.g. Jozsa ^7] for a review). 
Another important motivation is the relationship between the HSP over the dihedral group with hidden shift 
problems @] and cryptographically important cases of the Shortest Lattice Vector problem ,2& . 

So far, algorithms for the HSP arc only known for a few families of nonabelian groups, including wreath 
products Z§ I 1i ; more generally, semidirect products K x Z?; where K is of polynomial size, and 
groups whose commutator subgroup is of polynomial size ^Sj; "smoothly solvable" groups [Zj; and some 
semidirect products of cyclic groups (Tl|. Ettinger and H0yer [3] provided another type of result, by showing 
that Fourier sampling can solve the HSP for the dihedral groups D n in an information-theoretic sense. That 
is, a polynomial number of experiments gives enough information to reconstruct the subgroup, though it is 
unfortunately unknown how to determine H from this information in polynomial time. 

To discuss Fourier sampling for a nonabelian group G, one needs to develop the Fourier transform over 
G which relics on the group's linear representations. For abelian groups, the Fourier basis functions are 
homomorphisms <j> : G — > C such as the familiar exponential function 4>k{x) = e 27r * kx / n for the cyclic group 
Z„. In the nonabelian case, there are not enough such homomorphisms to span the space of all C- valued 
functions on G; to complete the picture, one introduces representations of the group, namely homomorphisms 
p : G — > U(V) where U(V) is the group of unitary matrices acting on some C- vector space V of dimension 
dp. It suffices to consider irreducible representations, namely those for which no nontrivial subspace of V is 
fixed by the various operators p(g). Once a basis for each irreducible p is chosen, the matrix elements pij 
provide an orthogonal basis for the vector space of all C-valued functions on G. 

The quantum Fourier transform then consists of transforming (unit- length) vectors in C[G] = 
{J2g£G a g \d) I a g £ from the basis {\g) | g 6 G} to the basis {\p, where p is the name of an 
irreducible representation and 1 < i,j < d p index a row and column (in a chosen basis for V). Indeed, 
this transformation can be carried out efficiently for a wide variety of groups [21 El El- Note, however, 
that a nonabelian group G does not distinguish any specific basis for its irreducible representations which 
necessitates a rather dramatic choice on the part of the transform designer. Indeed, careful basis selection 
appears to be critical for obtaining efficient Fourier transforms for the groups mentioned above. 

Perhaps the most fundamental question concerning the hidden subgroup problem is whether there is 
always a basis for the representations of G such that measuring in this basis (in Step 4, above) provides 
enough information to determine the subgroup H . This framework is known as strong Fourier sampling. 




Alternately, we can view the first register as being in a mixed state with density matrix 
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In this article, we answer this question in the negative, showing that natural subgroups of S n cannot be 
determined by this process; in fact, we show this for an even more general model, where we perform an 
arbitrary positive operator- valued measurement (POVM) on coset states \cH). We emphasize that the 
subgroups on which we focus are among the most important special cases of the HSP, as they are those to 
which Graph Isomorphism naturally reduces. 

Related work. The terminology "strong Fourier sampling" [§] was invented to distinguish this approach 
from the natural variant, called weak Fourier sampling, where one only measures the name of the represen- 
tation p, and ignores the row and column information. Weak Fourier sampling is basis-independent, making 
it attractive from the standpoint of analysis; however, it cannot distinguish conjugate subgroups from each 
other, and Hallgren, Russell and Ta-Shma ^21 showed that it cannot distinguish the trivial subgroup from 
an order-2 subgroup consisting of n/2 disjoint transpositions. Specifically, they used character bounds to 
show that the probability distribution obtained on representation names for the trivial and order-2 subgroups 
are exponentially close in total variation distance: it requires an exponential number of such experiments 
to distinguish them. Kempe and Shalev [T^] have generalized this result to other conjugacy classes, and 
conjecture that one can do no better than classical computation with this approach. 

In an effort to shed light on the power of strong Fourier sampling, Grigni, Schulman, Vazirani and 
Vazirani [§] showed that, for groups such as S n , measuring in a random basis yields an exponentially small 
amount of information. This can be explained, roughly, by the fact that projecting a vector into a sufficiently 
high-dimensional random subspace results in tightly concentrated length. On the other hand, Moore, Rock- 
more, Russell and Schulman |25| showed that for the affine and q-hedral groups, measuring in a well-chosen 
basis can solve the HSP (at least information-theoretically) in cases where random bases cannot. 

Our contribution. In this paper we show that strong Fourier sampling, in an arbitrary basis of the 
algorithm designer's choice, cannot solve the HSP for S n . As in ^21 we focus on order-2 subgroups of the 
form {l,m} where m is an involution consisting of n/2 disjoint transpositions; we remark that if we fix 
two rigid connected graphs of size n/2 and consider permutations of their disjoint union, then the hidden 
subgroup is of this form if the graphs are isomorphic and trivial if they are not. Then we show that strong 
Fourier sampling — and more generally, arbitrary measurements of coset states — cannot distinguish these 
subgroups from each other, or from the trivial subgroup, without an exponential number of experiments. 

We remark that our results do not preclude the existence of an efficient quantum algorithm for the 
HSP on S n . Rather, they force us to consider multi-register algorithms, in which we prepare multiple coset 
states and subject them to entangled measurements. Ettinger, H0yer and Knill j^j showed that the HSP 
on arbitrary groups can be solved information-theoretically with a polynomial number of registers, and the 
authors have shown how to carry out such a measurement in the Fourier basis |21| . Kuperberg |2l)| devised 
a subexponential (2°^ l ° sn ^) algorithm for the HSP on the dihedral group D n that works by performing 
entangled measurements on two registers at a time, Bacon, Childs, and van Dam have determined the 
optimal multiregister measurement for the dihedral group, and the present authors have generalized this to 
all cases where the H and G form a Gel'fand pair |22| . 

Whether a similar approach can be taken to the symmetric group is a major open question. In a 
companion paper |23| . the first two authors take a step towards answering this question by showing that if 
we perform arbitrary entangled measurements over pairs of registers, distinguishing H = {l,m} from the 
trivial group in S n requires a superpolynomial number (specifically, ^(v^/Jos™)) Q f experiments. 

2 Fourier analysis over finite groups 

We briefly discuss the elements of the representation theory of finite groups. Our treatment is primarily for 
the purposes of setting down notation; we refer the reader to (HIE] for complete accounts. 

Let G be a finite group. A representation p of G is a homomorphism p : G — > U(V), where V is a finite 
dimensional Hilbert space and U(V) is the group of unitary operators on V. The dimension of p, denoted 
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dp, is the dimension of the vector space V. By choosing a basis for V, then, each p(g) is associated with a 
unitary matrix \p(g)] so that for every g,h G G, [p(gh)\ — [p(g)\ ■ [p{h)\ where • denotes matrix multiplication. 

Fixing a representation p : G — > U(T^), we say that a subspace W C V is (left) -invariant if /^(g)^ C W 
for all g G G; observe that in this case the restriction pw ■ G — > U(W), given by restricting each p{g) to 
W , is also a representation. When p has no invariant subspaces other than the trivial space {0} and V, p 
is said to be irreducible. When p is irreducible, Schur's lemma asserts that the centralizer of the subgroup 
impc U(V) C GL(V) — that is, the set ofie GL(V) such that Ap(g) = p(g)A for all g G G — consists only 
of the scalar matrices {cl | c G C}. We use this fact below. 

In the case when p is not irreducible, then, there is a nontrivial invariant subspace W C V and, as the 
inner product (•, ■) is invariant under the unitary maps p(g), it is immediate that the dual subspace 

W 1 - = {u | Vw G W, (u, w) = 0} 

is also invariant. Associated with the decomposition V = W © W is the natural decomposition of the 
operators p(g) = pw(g) ® Pw^id)- By repeating this process, any representation p : G — > li(V) may be 
decomposed into a direct sum of irreducible representations: we write p = o\ © • • • © and, for the a* 
appearing at least once in this decomposition, (Tj -< p. In general, a given a can appear multiply in this 
decomposition, in the sense that p may have an invariant subspace isomorphic to the direct sum of a a copies 
of a. In this case a a is called the multiplicity of o~i in p, and we write p = Q) a<p a a o~ where a a <r = a © • • • © a. 

If two representations p and a are the same up to a unitary change of basis, we say that they are 
equivalent. It is a fact that any finite group G has a finite number of distinct irreducible representations up 
to equivalence and, for a group G, we let G denote a set of representations containing exactly one from each 
equivalence class. The irreducible representations of G give rise to the Fourier transform. Specifically, for a 
function / : G — > C and an element p G G, define the Fourier transform of f at p to be 

V 11 gee 

The leading coefficients are chosen to make the transform unitary, so that it preserves inner products: 

</i, h) = £ /r (.g)/ 2 (<?) = E tr (A(<°) f ■ 

9 pGG 

There is a natural product operation on representations: if p : G — > U(V) and er : G — > U(W / ) arc 
representations of G, we may define a new representation ptXier : G — * U(I / (8) W) by extending the rule 
p © a(g) :u®vh p(g)u © cr(^)v. In general, the representation /? © a is not irreducible, even when both p 
and a are. This leads to the Clebsch-Gordan problem, that of decomposing p® a into irreducibles. 

For a representation p we define the character of p, denoted \ p , to be the function Xp ' G — > C given by 
Xp(g) = trp(g). As the trace of a linear operator is invariant under conjugation, characters are constant on 
the conjugacy classes of G; for a conjugacy class A = {gag^ 1 \ g G G}, wc define x{A) = x( a )- Characters 
are a powerful tool for reasoning about the decomposition of reducible representations. In particular, when 
p = ® . o~i we have Xp = Si Xo-« and, moreover, for p, cr G G, we have the orthogonality conditions 

(x P , x*)g = w E x P (g)xa(g)* = 

11 sec 

Then for an irreducible representation p and representation cr, (XpiXo-)g is equal to the multiplicity with 
which p appears in the decomposition of a. For example, since Xp<»a-(g) = Xpid) ' Xerls); the multiplicity of 
r in p©cr is (Xt,X p X<j)g 

In general, we can consider subspaces of C[G] that are invariant under left multiplication, right multipli- 
cation, or both; these subspaces are called left-, right-, or bi-invariant respectively. Each p G G corresponds 
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to a dp-dimensional bi-invariant subspace of C[G], which can be broken up further into d p d p -dimensional 
left-invariant subspaces, or (transversely) d p d p -dimensional right-invariant subspaces. However, this decom- 
position is not unique. If p acts on a vector space V, then choosing an orthonormal basis for V allows us 
to view p(g) as a d p x d p matrix. Then p acts on the d^-dimensional space of such matrices by left or right 
multiplication, and the columns and rows correspond to left- and right-invariant spaces respectively. More 
generally, each left-invariant subspace isomorphic to p corresponds to a unit vector b g V. 



3 The structure of the optimal measurement 

In this section we show that starting with a random coset state, the optimal one-register measurement for the 
hidden subgroup problem is precisely an instance of strong Fourier sampling (possibly in an over-complete 
basis). This has been pointed out several times in the past, at varying levels of explicitness |151 12U|: we 
state it here for completeness. Everything we say in this section is true for the hidden subgroup problem 
in general. However, for simplicity we focus on the special case of the hidden subgroup problem called the 
hidden conjugate problem in |25| : there is a (non- normal) subgroup H, and we are promised that the hidden 
subgroup is one of its conjugates, H 9 = g~ 1 Hg for some g £ G. 

We may treat the states arising after Step 3 of the procedure above as elements of the group algebra 
C[G]. We use the notation \g) = 1 • g G C[G] so that the vectors \g) form an orthonormal basis for C[G]. 
Given a set S C G, IS*) denotes a uniform superposition over the elements of S, \S) = (1/yjSf) Y^seS I s )- 



3.1 The optimal POVM consists of strong Fourier sampling 

The most general type of measurement allowed in quantum mechanics is a positive operator-valued measure- 
ment (POVM). A POVM with a set of possible outcomes J consists of a set of positive operators {Mj | j € J} 
subject to the completeness condition, 

« • (3-1) 

3 

Since positive operators are self-adjoint, they can be orthogonally diagonalized, and since their eigenvalues 
are positive, they may be written as a positive linear combination of projection operators (see e.g. [211 §10]). 
Any POVM may thus be refined so that each Mj = ajPj where pj is a projection operator and a., is positive 
and real. 

The result of this measurement on the state \ip) is a random variable, taking values in J, that is equal to j € 
J with probability Pj = a,j (tp\ pj \ip). Note that outcomes j need not correspond to subgroups directly; the 
algorithm designer is free to carry out a polynomial number t of experiments, observing outcomes ji,. ■ ■ ,jt, 
and then apply some additional computation to find the most likely subgroup given these observations. 

If g is chosen from G uniformly so that the hidden subgroup is a uniformly random conjugate of H , we 
wish to find a POVM that maximizes the probability of correctly identifying g from the coset state \H 9 ). 
(Of course, to identify a conjugate H 9 , we only need to specify g up to an element of the normalizer of H.) 
Since a random left coset of H 9 can be written cgH 9 = cHg for a random c € G, the probability we observe 
outcome j is 
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Ip |15| observed that in the special case that each outcome j corresponds to a subgroup, maximizing the 
probability that j is correct subject to the constraint (|3.1|) gives a semi-definite program. Since such programs 
are convex, the optimum is unique and is a fixed point of any symmetries possessed by the problem. 

However, our proof relies on an elementary "symmctrization" argument. Given a group element x £ G, 
let L x \g) = \xg) denote the unitary matrix corresponding to left group multiplication by x. In particular, 
applying L x maps one left coset onto another: \cHg) = L c \Hg). Writing 



Pj = a i (c H 9\ H \cHg) = a,j {Hg ^ l \n l <- 
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Hg 
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we conclude that replacing pj for each j with the symmetrization 



does not change the resulting probability distribution Pj. Since plj commutes with L x for every x £ G and 
provides exactly the same information as the original pj , we may assume without loss of generality that the 
optimal POVM commutes with L x for every x E G. 

It is easy to see that any projection operator that commutes with left multiplication projects onto a left- 
invariant subspace of C[G], and we can further refine the POVM so that each pj projects onto an irreducible 
left-invariant subspace. Each such space is contained in the bi-invariant subspace corresponding to some 
irreducible representation p, in which case we write im pj C p. As discussed in Section |21 a given irreducible 
left-invariant subspace corresponds to some unit vector b in the vector space V on which p acts. Thus we 
can write 



p 3 



1 



-Id 



where Irf acts within the left-invariant subspace. Let B = {hj 
ness condition for each p 6 G, 



b,es 



im p,j e p}: then (|3.1|) implies a complete- 

(3.3) 



and so B is a (possibly over-complete) basis for V. In other words, the optimal POVM consists of first 
measuring the representation name p, and then performing a POVM on the vector space V with possible 
outcomes B. Another way to see this is to regard the choice of coset as a mixed state; then its density matrix 
is block-diagonal in the Fourier basis, and so as Kuperberg puts it \2()\ measuring the representation name 
"sacrifices no entropy." 

We note that in the special case that this POVM is a von Neumann measurement — that is, when the 
B is an orthonormal basis for V — then it corresponds to measuring the column of p in that basis, which is 
how strong Fourier sampling is usually defined. (As pointed out in [5], nothing is gained by measuring the 
row, since wc have a random left coset cHg and left-multiplying by a random element c in an irreducible 
representation completely mixes the probability across the rows in each column. Here this is reflected by the 
fact that pj is a scalar in each left-invariant subspace.) However, in general the optimal measurement might 
consist of an over-complete basis, or frame, in each p, consisting of the vectors {hj} and the weights aj. 

Now that we know pj takes this form, let us change notation. Given p G G acting on a vector space 
V and a unit vector b £ V, let Il£ = |b) (b| <g> 1^ denote the projection operator onto the left-invariant 
subspace corresponding to b. Then pj = , and l|3.2[) becomes 



p j = a 3 



1 

W\ 



ceG 



\cHg) 



n b. 



\Hg) 



(3.4) 



We can write this as the product of the probability P{p) that we observe p, times the conditional probability 
P(p, hj) that we observe by. Note that by (|3.3|) . 



IP 



b,es 



is the projection operator onto the bi-invariant subspace corresponding to p. Then 

Pj = P(p)P(p,b j ) 

where 



P{p) = \\W\H) 



P(p,h 3 



K 3 \Hg) / P( P ) ■ 



(3.5) 
(3.6) 
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Note that P(p, bj) depends on g but P(p) does not, which is why weak sampling is incapable of distinguishing 
conjugate subgroups. 



3.2 The probability distribution for a conjugate subgroup 

Now let us use the fact that \H) is a superposition over a subgroup, and calculate P(p) and P(p, bj) as 
defined in (|3.5() and (|3.6|) . This will set the stage for asking whether we can distinguish different conjugates 
of H from each other or even the trivial subgroup. 

Fix an irreducible representation p that acts on a vector space V. Then Fourier transforming the state 



\H) = 



1 



\H 



£i*> 

heH 



yields the coefficient 



H(P) = 



\H\\G\ 



ld P \H\ n 



heH 



\G\ 



H 



where Hh = (1/\H \) Y^heH PV 1 ) 1S a projection operator onto a subspace of V. The probability that we 
observe p is then the norm squared of this coefficient, 



P{p)= H(p) 



IGI rk 



(3.7) 



and, as stated above, this is the same for all conjugates H 9 . The conditional probability that we observe the 
vector bj, given that we observe p, is then 



P(p,hj) 



H(p)b 3 



a 



1 rkn H 



P(p) J P(p) 

In the case where H is the trivial subgroup, II# = Ij and P(p, bj) is given by 



(3.8) 



(3.9) 



We call this the natural distribution on the frame B = {bj}. In the case that B is an orthonormal basis, 
cij = 1 and this is simply the uniform distribution. 

This probability distribution over B changes for a conjugate H 9 in the following way. Again ignoring left 
multiplication since the columns arc left-invariant, the Fourier transform becomes 



Hg(p) 



ld \H\ 



and we have 



P(p,bj) 



n g ffbj 
rkn 



H 



where we write gh for p(g)b. It is not hard to show that, for any b £ V, the expected value of \\Hh (<?b) | , 
over the choice of g £ G, is rk Tin/dp. Our primary technical contribution is a method for establishing 
concentration results for this random variable. 
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4 The variance of projection through a random involution 



In this section we focus on the case where H = {1, m} for an element m chosen uniformly at random from 
a conjugacy class [m] of involutions. (Observe that order is preserved under conjugation so that if m is an 
involution, then so are all elements of [m].) Given an irreducible representation p : G — » U(V) and a vector 
b G V, we bound the variance, over the choice of m, of the probability P(p, b) that b is observed given 
that we observed p. Our key insight is that this variance depends on how the tensor product representation 
p®p* decomposes into irreducible representations a, and how the vector b<g>b* projects into the constituent 
orthogonal irreducibles. 

Recall that, if a representation p is reducible, it can be written as an orthogonal direct sum of irreducibles 
p = ® a ^ p a a a where a a is the multiplicity of a. We let II£ denote the projection operator whose image is 
a CT cr, that is, the span of all the irreducible subspaces isomorphic to a. 

Lemma 1. Let p be a representation of a group G acting on a space V and let b G V. Let m be an element 
chosen uniformly from a conjugacy class [m] of involutions. If p is irreducible, then 



If p is reducible, then 



Exp m (b,mb)^^Ml||b|| 2 



Exp m (b,mb) = X^M||n^b|| 2 



a<p 

Proof. Fix a particular element p G [to]. Let p([m]) denote the average of p(m) over [m]; this is 

P(M) = TF-TT P{™) = |77| X P^W) = T7T\Y1 ^Mm) P{9) ■ 

l[m|l me[m] 1 1 geG ' ' g£G 

Observe that p([m]) commutes with p(g) for all g G G and hence, by Schur's lemma, its action on any 
irreducible subspace is multiplication by a scalar. Note that for the scalar linear operator A = aid acting 
on a space of dimension d, we have a = tr A/d. In particular, if p is irreducible then 

p(H) = ^U p 

dim p p 

and so 

Exp m (b,mb)=(b,p(H)b) = ^^||b|| 2 . 
If p is reducible, these same considerations apply to each irreducible subspace: 

dim a 



□ 



a-<p 

and so 

Exp m (b,mb) = (b,p(H)b) = J2 ^W^b) = J2 IIIWI 2 

* — ' dim a * — ' dim a 

<y-<p cr-ip 

Turning now to the second moment of (b, mb), we observe that 

|(b,mb)| 2 = (b, mb)(b, mb)* = (b ® b*, mb g) mb*) = (b g) b*, m(b ® b*)), 

where the action of m on the vector h<S> b* is precisely given by the action of G in the representations p® p* . 
This will allow us to express the second moment of the inner product (b, mb) in terms of the projections of 
b (g) b* into the irreducible constituents of the tensor product representation p® p* . 
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Lemma 2. Let p be a representation of a group G acting on a space V and let b G V . Let m be an element 
chosen uniformly at random from a conjugacy class [m] of involutions. Then 



Exp m |(b,mb) 



E 



a^p^ip" 



dirtier 



Proof. We write the second moment as a first moment over the product representation p ® p*: as above, 
|(b,mb)| 2 = (b(g)b*,m(b® b*)}, so that 

Exp m |(b,rab)| 2 = Exp m (b® b*,m(b® b*)) 

and applying Lemma ^completes the proof. □ 

Now let n m = T\h denote the projection operator given by 



11™. v 



For a given vector b G B, we will focus on the expectation and variance of ||n m b||". These are given by the 
following lemma. 

Lemma 3. Let p be an irreducible representation acting on a space V and let b G V . Let m be an element 
chosen uniformly at random from a conjugacy class [m] of involutions. Then 



Ex Pm ||n m b|| 2 = i|| b f(i + *«) 

x>(M) 



b|| 2 < - E 



4 ^—-t dim a 



Tlf" (b<g>b*) 



(4.1) 
(4.2) 



Proof. For the expectation, 



Exp m ||II m b| 



Exp m (b,n m b) 
-Exp m «b,b) + <b,mb» 



where the last equality follows from Lemma ^ 

For the variance, we first calculate the second moment, 

Exp m ||n m b|| 4 = Ex Pm |(b,n m b)| 2 

= iExp m |(b,b) + (b,mb)| 2 
1 



= - Exp m ( | (b, b) | 2 + 28*(b, b) (b, mb) + | (b, mb) | 2 



4 " b 



|4 + 2 || b ||4Xp(N) 



dim p 



E 

<r-!p<g>p* 



x«t(H) 

dim a 



Yf® p ' (b®b*) 



where in the last line we applied Lemmas ^ and 121 an d the fact that any character evaluated at an involution 
is real. Then 



Var m ||n m b|| = Exp m ||n m b|| - (Exp m ||II m b|| 

Xp(M) 



E 



dim p 



Tlf" (bob*) 



-llbl 



( x P (H) 

\ dim p 



Ignoring the second term, which is negative, gives the stated result. 



(4.3) 
□ 
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Finally, we point out that since 



we have 



a fact which we will use below. 



Ex Pm ||n m b!| 2 = ||b|| 2: 



rk IL 



dim p 



- = l(l+ Xp{[m]) 
dimp 2 \ dimp 



(4.4) 



5 The representation theory of the symmetric group 

In this section we record the particular properties of S n and its representation theory applied in the proofs 
of the main results. The irreducible representations of S n are labeled by Young diagrams, or equivalently by 
integer partitions of n, 

A=(Ai,...,A t ) 

where A; = n and A; > A; + i for all i. The conjugate Young diagram A' is obtained by flipping A about 
the diagonal: A' = (X[, . . . , X' Xi ) where A^ = \{i | A; > j}\. In particular, X[ = t. The number of such 
diagrams, equal to the number of conjugacy classes in S n , is the partition number p(n), which obeys 



p(n) = (l + o(l)) 



1 



4v^3 • ■ 



We denote these irrcducibles S x , their characters x A , and their dimensions d x . In particular, S x is the trivial 
or parity representation if A is a single row (n) or a single column (1, . . . , 1) respectively, and S x is the 
(tensor) product of S x with the parity representation. 

The dimensions of the representations of the symmetric group are given by the remarkable hook length 
formula: 

d x = 



n c hook(c) ' 

where this product runs over all cells of the Young diagram associated with A and hook(c) is the number of 
cells appearing in either the same column or row as c, excluding those that are above or to the left of c. 

For example, the partition A = (Ai, A2, A3, A4) = (6,5,3,2) is associated with the diagram shown in 
Figurcnbclow. The hook associated with the cell (2, 2) in this diagram appears in Figurc|21 it has length 6. 



r 



P 



P 



Figure 1: The Young diagram for A = (6, 5, 3, 2). 



Figure 2: A hook of length 6. 



The symmetric groups have the property that every representation S x possesses a basis in which its 
matrix elements are real, and so all its characters are real. However, in a given basis S x might be complex, 
so we will refer below to the complex conjugate representation (S x )* (not to be confused with S x ). 
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The study of the asymptotic properties of the representations of S n typically focuses on the Plancherel 
distribution. (See, e.g., Kerov's monograph for further discussion.) For a general group G, this is the 
probability distribution obtained on G by assigning p the probability density d 2 p /\G\. One advantage of 
this distribution is that the density at p is proportional to its contribution, dimensionwise, to the group 
algebra C[G]; this will allow us to reduce structural questions about representations chosen according to 
this distribution to questions about the regular and conjugation representation. Note that in the context of 
the hidden subgroup problem, the Plancherel distribution is exactly the one obtained by performing weak 
Fourier sampling on the trivial hidden subgroup. 

In the symmetric groups a fair amount is known about representations chosen according to the Plancherel 
distribution. In particular, Vershik and Kerov have shown that with high probability they have dimension 
equal to e e ^^>VrJ.. 

Theorem 4 ([34 ). Let S x be chosen from S n according to the Plancherel distribution. Then there exist 
positive constants C\ and ci for which 



lim Pr 

n — >oo 



e- Cl ^V^! < d x < e~ c ^\fn~\ 



1 



Vershik and Kerov have also obtained estimates for the maximum dimension of a representation in S n : 
Theorem 5 (|_34j). There exist positive constants c and c such that for all n > 1, 

e-^Vri < max d x < e'^V^. . 
s A es„ 

Along with these estimates, we shall require some (one-sided) large-deviation versions of Theorem 
recorded below. 



Lemma 6. Let S x be chosen according to the Plancherel distribution on S n 



< e 



1. Let 8 = 7Ty/2/3. Then for sufficiently large n, Pr 

2. LetO<c< 1/2. Then Pr[d x < n cn ] = n - n ( n ) . 

Proof. For the first bound, setting d = e~ 5 v™\/n\ and using p(n) < e s ^™, we have 

(d x ) 2 .d 2 



* — ' n\ n 



S x :d x <c 

For the second bound, recalling Sterling's approximation n\ ~ yhvnin / e) n , we have 



(l-2c)n O(n) _ „-n(n) 



n ' e 



□ 



Finally, we will also apply Roichman's [2Hi estimates for the characters of the symmetric group: 

Definition 1. For a permutation tt £ S n> define the support of n, denoted supp(7r), to be the cardinality of 
the set {k G [n] \ ir(k) ^ k}. 

Theorem 7 ([28J). There exist constants b > and < q < 1 so that for n > 4, for every conjugacy class 
C of S n , and every irreducible representation S x of S n , 



X X (C) 



Ai a; 



< max I q, 

n n 



b-supp(C) 



d X 

where supp(C) = supp(7r) for any tt £ C. 

In our application, we take n to be even and consider involutions m in the conjugacy class of elements 
consisting of n/2 disjoint transpositions, M = AI n = {a ((12)(34) ■ ■ • (n — 1 n)) c -1 | a £ S n }. Note that 
each m € M n is associated with one of the (n — 1)!! perfect matchings of n things, and that supp(m) = n. 
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6 Strong Fourier sampling over S n 

We consider the hidden subgroup H = {l,m}, where m is chosen uniformly from M = M n C S n , the 
conjugacy class 

{^- 1 ((12)(3 4)---(n-ln))^|^G5„} ; 

we assume throughout that n is even. We start by measuring the name of an irreducible representation, 
yielding S x for a diagram A. We remark that Hallgren, Russell, and Ta-Shma ^2] established that the 
probability distribution on A is exponentially close to the Planchcrcl distribution in total variation. We allow 
the algorithm designer to choose an arbitrary POVM, with a frame B = {hj} and weights {aj} obeying 
the completeness condition (13. 3|) . We will show that with high probability (over m and A), the conditional 
distribution induced on the vectors B is exponentially close to the natural distribution l|3.9fl on B. It will 
follow by the triangle inequality that it requires an exponential number of single-register experiments to 
distinguish two involutions from each other or, in fact, distinguish H from the trivial subgroup. 

For simplicity, and to illustrate our techniques, we first prove this for a von Neumann measurement, i.e., 
where B is an orthonormal basis for S x . In this case, we show that the probability distribution on B (or 
equivalcntly, on the columns of S x ) is exponentially close to the uniform distribution. 



6.1 von Neumann measurements 

Theorem 8. Let B = {b} be an orthonormal basis for an irreducible representation S x . Given the hidden 
subgroup H = {l,m} where m is chosen uniformly at random from M , let P m (b) be the probability that we 
observe the vector b conditioned on having observed the representation name S , and let U be the uniform 
distribution on B. Then there is a constant S > such that for sufficiently large n, with probability at least 
1 — e~ Sn in m and A, we have 

WPm-UW.Ke^ 71 . 

Proof. First, recall from 1)3. 8fl in Section [3] that the conditional distribution on B is given by (since ctj = 1) 

P m (b)=P(S\b) = &£C . (6.1) 

Our strategy will be to bound Var m ||IT m b|| 2 using LemmaEl and apply Chebyshev's inequality to conclude 
that it is almost certainly close to its expectation. Recall, however, that our bounds on the variance of 
||n m b|| 2 depend on the decomposition of S x <S> (S x )* is into irreducibles and, furthermore, on the projection 
of b (g> b* into these irreducible subspaces. Matters are somewhat complicated by the fact that certain 5 M 
appearing in S x ® (S x )* may contribute more to the variance than others. While Theorem [7] allows us to 
bound the contribution of those constituent irreducible representations for which /zi and fi^ are much 
smaller than n, those which violate this condition could conceivably contribute large terms to the variance 
estimates. Fortunately, we will see that the total fraction of the space S x (£> (S x )*, dimensionwise, consisting 
of such is small with overwhelming probability. Despite this, we cannot preclude the possibility that for a 
specific vector b, the quantity Var ||IT m b|| 2 is large, as b may project solely into spaces of the type described 
above. On the other hand, as these troublesome spaces amount to a small fraction of S x ® (5* A )*, only a few 
b can have this property, and we will see that this suffices to control the distance in total variation to the 
uniform distribution. 

Specifically, let < c < 1/4 be a constant, and let A = A c denote the collection of Young diagrams \x 
with the property that either /ii > (1 — c)n or fi[ > (1 — c)n. We have the following upper bounds on the 
cardinality of A and the dimension of any S 1 ^ with fi G A: 

Lemma 9. Let p{n) denote the number of integer partitions of n. Then |A| < 2cnp(cn), and d^ < n cn for 
any ji £ A. 
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Proof. For the first statement, note that removing the top row of a Young diagram p with fii > (1 — c)n 
gives a Young diagram of size n — pi < cn. The number of these is at most p(cn), and summing over all 
such fix gives cnp(cn). The case p\ > (1 — c)n is similar, and summing the two gives |A| < 2cnp(cn). 

Now let fj, € A with /xi > (1 — c)n. By the hook-length formula, since the ith cell from the right in the 
top row has hook(c) > i, < n\/ p\\ < n\/((l — c)n)\ < n cn . The case p! x > (1 — c)n is similar. □ 

As a result, the representations associated with diagrams in A constitute a negligible fraction of S n ; 
specifically, from Lemma El part the probability that a A drawn according to the Plancherel distribution 
falls into A is n~ n< ~ n \ The following lemma shows that this is also true for the distribution P(p) induced on 
S n by weak Fourier sampling the coset state \H). 

Lemma 10. Let d < 1/2 be a constant and let n be sufficiently large. Then there is a constant 7 > such 
that we observe a representation S x with d x > n dn with probability at least 1 — n~ ln . 

Proof. The proof is nearly identical to that of LemmaEl and follows an argument from Recall from l|3.7(l 
in Section that we observe an irreducible p with probability P(p) = (d p \H\/\G\)r\i tie- Given \H\ = 2, 
|G| = n\, and rk tth < d p , we have P(p) < 2d 2 /n\. Since the total number of irreducibles of S n is p(n) and 
nl > n n e~ n , we have 

P{S X ) < 2p(n)n 2dn /nl < 2p{n)e n n {2d - 1)n 

S x :d x <n dn 

and setting 7 < 1 — 2d completes the proof. □ 
On the other hand, for a representation with p ^ A, Theorem implies that 



X "(M) 



d>> 



< (max( g ,l-c)) b " <e- an (6.2) 



for a constant a > be > 0. Thus the contribution of such an irreducible to the variance estimate of Lemma 
is exponentially small. In addition, let Eq be the event that d x > n dn as in Lemma ^3 then conditioning on 
E and setting d such that c < 1/4 < d, Lemma implies that A ^ A, and Equations (|4.4|) and i|tj.2fl imply 

y (1 - e- an ) < rk n m < y (l + e~ an ) . (6.3) 

We turn now to the problem of bounding the multiplicities with which representations S^, for p £ A, 
can appear in S x ® (<S* A )*. While no explicit decomposition is known for <8> (S x )*, the endomorphism 
representations of S n , we record a coarse bound below which will suffice for our purposes. Recall that 
character of S x (8> (S x )* is \ X ' (x A )* = (x A ) 2 as characters of S n arc real. The multiplicity of the representation 

in S x g) (S x )* is (x M , (x A ) 2 )g- However, this is equal to (x >1 X X ,X X )g, the multiplicity of S x in the 
representation S 11 <£> S x . Counting dimensions, this is clearly no more than dim(5 M ® S x )/ dimS' A = d^. 
Hence the multiplicity of in S x (g) (S x )* is never more than c? M ; we have 

(X M ,(X A ) 2 ) G <^ ■ (6-4) 

Let L C S x ® (5" A )* be the subspace consisting of copies of representations with p E A, and let 
be the projection operator onto this subspace. By Lemma El we have 



dimX < ^(#) 2 < 2cnp(cn)n 2cn = e^^n 2 ™ 



Note by Lemma llUI with d > 2c, dimL is a vanishingly small fraction of d x . 

As B is an orthonormal basis for S x , the vectors {h<E> b* | be B} arc mutually orthogonal in S x <E) (S x )*. 
Therefore, 

Y, ||n L (b®b*)i| 2 < dimX . 

heB 
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In particular, if Bl C B denotes the set of b with the property that 



||II L (b®b*)|| 2 > e - an , 

then, conditioning on Eq, \Bl\ cannot be larger than e an dimL = e °( n ) n 2cn — n~°( n )<i A . For a basis vector 
b ^ Bl, we apply Lemma[!|]to bound the variance as follows: assuming pessimistically that x M (M)/d'' = 1 
for all (J, e A, and applying (|6.2|) . gives 



Var m |in m bir < - 



1 

< - 
~ 4 



J2 \K(b®h*)\\ 2 + J2 ^M||n^(b®b*) 112 



(ll< 



-an +e -an \\U X {b(g,h*) 
S" :/i£A 



< - e' an 
~ 2 



In this case, by Chebyshev's inequality 

Pr 



n m b|| 2 - Exp m ||n m b|| 2 



> e 



-an/3 



< e ~ an/3 . 



(6.5) 



We say that a basis vector is bad if this bound is violated, i.e. 



|n m b|| 2 -Ex Pm ||n m b| 



Let .Bbad denote the subset of B consisting of bad basis vectors (observe that while Bl depends only on the 
choice of A, Sbad depends also on m). Then (|6.5J) implies 

Exp m |S bad | <e~ an ' 3 d x . 

Now let Ei be the event that 

|Bbad| < e- Q " /6 d A ; 

then by Markov's inequality, E\ occurs with probability at least 1 — e~ Q ™/ 6 . 

Now let us separate \\P m ~U\\ 1 into contributions from basis vectors outside and inside Bl U -Bbad: 

\\P m -U\\ 1= ]T \P m (b)-U\+ Yl \ p M-U\ . (6.6) 
The first sum is taken only over vectors b for which 



|n m b|| 2 -Ex Pm ||n m b|| 2 



< e 



— an/3 



(6.7) 



Conditioning on Eq and recalling that P m (h) = ||n m b|| 2 /rkll m , the rank estimate of l|6.3[l gives 

\P m (b) -U\< —— ■ d x < < 4e' an / 3 . 

1 v ; 1 " rkn m ~ 1 -e- an 

b^iUB bad 

It follows that P m (B L U 5 bad ) is at most \B L U -B bad |/d A + 4e~ Q "/ 3 . Therefore, since conditioning on Eq and 
E x we have \B L U -B bad | < (n-° (n) + er an / & )d x < 2e~ an ^ e d x 1 the second sum in (£3 is at most 



E 



p m (b) ^ 



< P m (B L U -Bbad) + lBL ^f badl < 4e- Q "/ 6 + 4e-""/ 3 < be~ an ' G . (6.8) 



beB L UB bad 

Then combining (j(H>j) . (|6~7j) and (|6~%|) . 

\\P m - < 4e- an / 3 + 5e- Q ™/ 6 < 6e~ an/6 
with probability at least Pr[-E A E{\ > 1 — n -7 ™ - e - an/6 > 1 - 2e~ Qn/6 . We complete the proof by setting 



5 < a/6. 



□ 
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6.2 Arbitrary POVMs 

We now generalize the proof of Theorem [S| to the case where the algorithm designer is allowed to choose an 
arbitrary finite frame B = {b} of unit length vectors in S x , with a family of positive real weights a^, that 
satisfy the completeness condition 

5>„ |b> <b| = 1 . (6.9) 

b 

(Note that this is simply i|3.3[) where wc have written b and ab instead of hj and aj.) 

Theorem 11. Let B = {b} be a frame with weights {ab} satisfying the completeness condition (|6.9() for 
an irreducible representation S x . Given the hidden subgroup H = {l,m} where m is chosen uniformly at 
random from M , let P m (h) be the probability that we observe the vector b conditioned on having observed 
the representation name S , and let N be the natural distribution l|3.9|l on B. Then there is a constant 5 > 
such that for sufficiently large n, with probability at least 1 — e~ Sn in m and \, we have 

\\P m -N\\ 1 <e- 5n . 

Proof. Recall from <|3.8[) in Section [3] that the conditional distribution on B is given by 



P m (b) = P(S\h) = a h 



|n m b|| 2 

rk n m 



and the natural distribution (|3.9(l is given by A^(b) = ab/d x . 

The proof of Theorem |H1 goes through with a few modifications. First, let us change some semantics: 
given a subset A C B, we let |^4| denote the weighted size of A, 

\A\ = J2 Ob ■ 

With this definition, the total probability that falls in A under the natural distribution is N(A) = |^4|/d A . 
Then we will use the following lemma: 

Lemma 12. Let L be a subspace of S x <8> (S A )* and let Tlr, be the projection operator onto L. Then 

^a b ||n L (b®b*)|| 2 <dimL . (6.10) 

bGB 

Proof. First note that a vector e S S x (g> (S x )* has entries e^fc for 1 < j,k < d A . There is a unique linear 
operator E on S x whose matrix entries are Ej,k = ej.k, and the inner product (b g) b*, e) in S x (£> (S x )* can 
then be written as the bilinear form (h,Eh) in S x . The Frobenius norm of E is ||£|| 2 = trE^E = ||ej| 2 . 
Now let {e^} be an orthonormal basis for L and let Ei be the operator corresponding to e^. Then 

"obKb®b* >ei >| a = ^a b |<b,^b)| 2 < ^ ab ||b|| 2 ||^b|| 2 = ^ a b ||^b|| 2 



b£B bS-B bSB heB 



"a b tr (El\b) (b\Ei) =tr 



beB 




" a h |b) (b| Ei 



= tr E}Ei = ||e ?; II 2 = 1 



where we used the Cauchy- Schwartz inequality in the second line and completeness in the second. Summing 
over the dim L basis vectors e< then gives l|6.10|l . □ 

We define A and Eg as before, and Lemmas I§1 and UHl still apply. As before, let L C S x ® (S x )* be the 
subspace consisting of copies of representations with fj, € A, and let Bl C B denote the set of b with the 
property that 

||n L (b<g)b*)|| 2 >e 



-an 
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Then Lemma IT21 implies that 

\B L \< e ml dimL 

where \Bi] is defined as above. We again define -Bbad as the set of b £ B \ Bl such that 

|n m b|| 2 -Ex Pm ||n m b|| 2 | > e ~ an/3 

and Chebyshev's and Markov's inequalities imply that the event E\, namely 

l^badl < e- an ' 6 d x , 

occurs with probability at least 1 — e~ Q ™/ 6 . 

We separate \\P m — N\\ x as we did \\P m — U\\ 1 before: 

11^-^111= E \P m (*)-N(b)\ + Y, l^(b)-^(b)| • (6.11) 

bPtUBbad beB L UB bad 



Since 



|n m b|| 2 -Ex Pm ||n in b|| 2 



< e Q ™/ 3 for all b ^ Bl U -Bbad, and since X)b ab = conditioning on 
Eg and using (|6.3() bounds the first sum as follows, 

E l p ™( b ) ~ ^( b )l < TkTu ' d " - T3^=r < 4e " W3 • ^ 6 - 12 ) 

bPtUB, Jad 

It follows that P m {BL U -Bbad) is at most U i?bad|/d A + 4e _cm / 3 . Conditioning on Eq and .Ei, we have 
\B L U Sbadl < (n~° (n) + e~ Q ™/ 6 )d A < 2e" Q "/ 6 d A . Thus the second sum in (fB~TT|) is at most 

E l p ™( b ) - ^( b )l ^ u S bad) + N(B L U Bbad) < 4e- Q ™/ 6 + 4e" Q "/ 3 < be' an ' & . (6.13) 

bG-BiU,B bad 

Then combining ijfTTTJl . (|B~T2"|) and JHUSJ, 

||P m - JV|| X < 4e~ Qn/3 + 5e- a ™/ 6 < 6 e - Qn/6 

with probability at least Pr[E A-Ei] > 1 - n~^ n - er an/6 > 1 - 2e~ Qn/6 . We complete the proof by setting 
S < a/6 as before. □ 



7 Structured involutions and the case of Graph Isomorphism 

The preceding development focuses on the case where the hidden subgroup is distributed uniformly among 
the conjugates of the subgroup H = {l,m}. As such, this shows that the canonical reduction of Graph 
Automorphism (the problem of determining whether a given graph has a non-trivial automorphism) to 
the hidden subgroup problem does not, in general, give rise to an efficient quantum algorithm via Fourier 
sampling. 

However, the canonical reduction of GRAPH Isomorphism to the hidden subgroup problem induces a 
more structured set of involutions. As referred to in the Introduction, fixing two rigid graphs G\ = (Vi, E\) 
and G<x = (V2, E2), each with n vertices, the automorphism group of their disjoint union (Vi U V2, E\ U E2) 
is nontrivial exactly when they are isomorphic, in which case it is generated by an involution m with full 
support such that m(Vi) = V2. Identifying V\ and V2 with the sets {1, . . . , n} and {n+1, . . . , 2n} respectively, 
and letting s denote the involution (ln + l)(2n + 2)...(n-l 2n), the standard reduction to the hidden 
subgroup problem in 5*2 ra then results in a hidden subgroup H = {l,m} where m is a conjugate involution 
a~ 1 sa. However, rather than a being drawn from all of S^ra) it is an element of the Young subgroup S n „ 
which fixes V\ and V2'. 

S n ,n = {tT £ S 2 n I 7r({l, . . . , n}) = { 1 , . . . , 7l}} = S„ X S n . 
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In other words, rather than considering all conjugates of m in S^nj it suffices just to consider conjugates 
in S ntn . A priori, it seems that this smaller set of possible hidden subgroups might be easier to identify. 
Moreover, let K be the subgroup generated by S n _ n and s: this is the wreath product S n which can 
also be written as a semidirect product K — (S n x S„) x Z2. Then each such H is contained in K, and it 
seems that it might be more intelligent to Fourier sample over K rather than over all of Sin- 

However, we can show that nothing is gained by this approach. First, note that the involutions described 
above form the (if-)conjugacy class 

{((a, a" 1 ),!) G (S n x Sn) x Z 2 | a G S„} . 

We remark the that the development of Section [3] is unchanged, and that the optimal measurement to find 
a hidden conjugate again consists of strong Fourier sampling. 

Now note that Fourier sampling over S^n and over K are equivalent for the following reason: suppose we 
are trying to distinguish a set of hidden subgroups Hi c G, all of which are contained in a subgroup K C G. 
Let T be a set of representatives for the cosets of K. Then a random left coset of Hi in G is the product of 
a random left coset of Hi in K with a random clement of T . Thus the mixed state describing a uniformly 
random coset of Hi in G can be written as the corresponding state over K with the completely mixed state 
over T . Since this completely mixed state (whose density matrix is the identity) contains no information, 
nothing is gained (or lost) by sampling over all of G rather than over K. 

Finally, we observe that one can obtain easy character bounds for this conjugacy class of K, One 
determines if's irreducible representations and their characters as follows. For two irreducible representations 
p and a of S n , let pM a denote their tensor product as a representation of S n>n = S n X S n . We consider the 
induced representation Indf^ n (plE <r) (see |31p and denote its character X{p.a}- It is easy to see that 

... a , ,s fo if * = 1 

X{ P ,a}{{{a,P),t)) = S , , m s, , , (ff , .,, n 

as the notation suggests, this depends only on the multiset {p, a}. Observe that if A and B are two distinct 
multisets of representations of S n (each of cardinality two) then the characters xa and xb are orthogonal. 
Furthermore, an easy computation shows that (x{p,a} 1 X{p,a} ) — 1 + bp,a and hence that X{p,a} IS irreducible 
if p ^ <j, while if p = a then the induced representation is the direct sum of two irreducible representations, 

Indf„ n (p Kl p) ^ (p IE p <g) 1) (p B p ® 7r) 

where 1 and tt are the trivial and sign representations, respectively, of Z2. Each of these irreducible repre- 
sentations acts on V p (8 V p , the vector space supporting the action of p M p. Both of them realize the element 
((a, P), 0) as the linear map p{a) ® p(f3), while pM p®t and p M p ® n realize the element ((1, 1), 1) as the 
maps which send u ® v to v ® u and — v <X> u respectively. The characters of these representations can be 
determined by inspection; in particular, 

XpH + XpW ift = 0, . _ jx P {a)+Xp(P) ift = 0, 

I a\ it! 1 X{p,p},n\\ a i P)i l ) — ) t a\ •( , , 

X P (.a(i) ifi=l, [-Xp(a^) if t = 1 , 

where X{p,jo},i and X{p,p},tt denote the characters of p M p ® 1 and p IE p ® 7r respectively. 

In light of this description of the irreducible characters of K , one finds that x<r,p((a, a -1 ), 1) = when 
<t ^ p and that |XpKlp,A(( a ) a -1 ), 1))| = d p for A G {1, tt}. In particular, the absolute value of the normalized 
character at p Mp<g> A is l/d p . As before, the dimensions d p of the diagrams with large normalized character 
sum to very little, and the analysis of Section [B] can be undertaken mutatis mutandis. Remarkably, this can 
be carried out without use of Roichman's estimates! 
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